Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rc peakcall 2797 #1478

Open
wants to merge 81 commits into
base: develop
Choose a base branch
from
Open

Rc peakcall 2797 #1478

wants to merge 81 commits into from

Conversation

rsc3
Copy link
Collaborator

@rsc3 rsc3 commented Jan 14, 2025

Description

Added the peakcalling step in ATAC.

  • This currently does not assign any outputs. (Done)
  • Currently peakcalling is turned on. (Done)
  • Validated by running on human and mouse scientific data.

Checklist

If you can answer "yes" to the following items, please add a checkmark next to the appropriate checklist item(s) and notify our WARP team by tagging @broadinstitute/warp-admins in a comment on this PR.

  • Did you add inputs, outputs, or tasks to a workflow?
  • Did you modify, delete or move: file paths, file names, input names, output names, or task names?
  • If you made a changelog update, did you update the pipeline version number?

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

Copy link

Remember to squash merge!

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

atac_data_mod.uns[key] = atac_data_mod.uns[key].to_pandas()

print("Write into h5ad file")
atac_data_mod.write_h5ad("~{bam_base_name}.peaks.h5ad")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
atac_data_mod.write_h5ad("~{bam_base_name}.peaks.h5ad")
atac_data_mod.write_h5ad("~{bam_base_name}.bin.h5ad")


print("Write into h5ad file")
atac_data_mod.write_h5ad("~{bam_base_name}.peaks.h5ad")
peak_matrix.write_h5ad("~{bam_base_name}.matrix.h5ad")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
peak_matrix.write_h5ad("~{bam_base_name}.matrix.h5ad")
peak_matrix.write_h5ad("~{bam_base_name}.peaks.h5ad")

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed to cellbybin


output {
File peaks_h5ad = "~{bam_base_name}.peaks.h5ad"
File matrix_h5ad = "~{bam_base_name}.matrix.h5ad"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
File matrix_h5ad = "~{bam_base_name}.matrix.h5ad"
File matrix_h5ad = "~{bam_base_name}.bins.h5ad"

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed to cellbypeak

Copy link

Remember to squash merge!

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

# peak calling using SnapATAC2
task PeakCalling {
input {
File bam
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe take the bam out

Copy link

Remember to squash merge!

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

Copy link

Remember to squash merge!

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

Copy link

Remember to squash merge!

Copy link

🔍Changelog Validation Results:

Comparing changelogs for pipelines that differ from the versions on 'origin/develop':
All changelog files are valid for this release.

Copy link

🔍Version Validation Results:

Comparing versions and changelogs for pipelines that differ from the versions on 'origin/staging':
All WDLs and changelog files appear to be valid for this release.

}

output {
File peaks_h5ad = "~{output_base_name}.cellbybin.h5ad"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we adjust output variable name as well?

output {
File bam_aligned_output = bam_aligned_output_atac
File fragment_file = fragment_file_atac
File fragment_file_index = fragment_file_index_atac
File snap_metrics = snap_metrics_atac
File library_metrics_file = library_metrics
File? peakcall_h5ad_file = PeakCalling.peaks_h5ad
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we add both peak calling output files? The cellbybin matrix and the cellbypeak matrix?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants